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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Robertson et al . 

(ii) TITLE OF INVENTION: Catalases 

(iii) NUMBER OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 

CECCHI, STEWART & OLSTEIN 

(B) STREET: 6 BECKER FARM ROAD 

(C) CITY: ROS ELAND 

(D) STATE: NEW JERSEY 

(E) COUNTRY: USA 

(F) ZIP: 07068 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 INCH DISKETTE 

(B) COMPUTER: IBM PS/2 

(C) OPERATING SYSTEM: MS-DOS 
p (D) SOFTWARE: WORD PERFECT 5.1 
. f~ 

^vi) CURRENT APPLICATION DATA: 

( A ) APPLICATION NUMBER: 08/674,887 
fil (B) FILING DATE : July 3, 1996 

n! (C) CLASSIFICATION: 

fyii) PRIOR APPLICATION DATA: 

Ql (A) APPLICATION NUMBER: 

[|1 (B) FILING DATE: 

ATTORNEY/ AGENT INFORMATION: 
S (a) NAME: Charles J. Herron 

ffi (B) REGISTRATION NUMBER: 28,019 

™ (C) REFERENCE / DOCKET NUMBER: 331400-55 

n^ix) TELECOMMUNICATION INFORMATION: 

Q (A) TELEPHONE: 201 r 994-1700 

[1 (B) TELEFAX: 201-994-1744 

(2) INFORMATION FOR SEQ ID NO: Is 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: 'NUCLEIC ACID 

( C ) STRAND EDNESS : S INGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGAATAAC GCATCCGCTG AC 



(2) INFORMATION FOR SEQ ID NO: 2: 

<i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 NUCLEIC ACID 

(B) TYPE: NUCLEOTIDES 

(C) STRAND EDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
CGGAAAGCTT TTACGACGCG ACGTCGAAAC G 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGGAAAAT CACAAACACT 



(SS INFORMATION FOR SEQ ID NO: 4: 

""5 (i) SEQUENCE CHARACTERISTICS 

%LI (A) LENGTH: 31 NUCLEOTIDES 

fy (B) TYPE: NUCLEIC ACID 

(c) STRANDEDNESS: SINGLE 
(D) TOPOLOGY: LINEAR 



CB (ii) MOLECULE TYPE: DNA 

U§ (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
cSaggtacc TTATTTCAGA TCAAACCGGT C 



(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



FLI 



INFORMATION FOR SEQ ID NO: 5: 




ATG AAT AAC GCA TCC GCT GAC 
Met Asn Asn Ala Ser Ala Aap 

5 



GAT CTA CAC AGT 
Asp Leu His Ser 
10 



AGC TTG CAG CAA AGA 
Ser Leu Gin Gin Arg 
IS 



TGC AGA GCA TTT GTT CCC TTG 
Cys Arg Ala Phe Val Pro Leu 
20 



GTA TCG CCA AGG 
Val Ser Pro Arg 
25 



CAT AGA GCA ATA AGG 
His Arg Ala lie Arg 
30 



GAG AGA GCT ATG AGC GGT AAA 
Glu Arg Ala Met Ser Gly Lys 
35 



TGT CCT GTC ATG 
Cys Pro Val Met 
40 



CAC GGT GGT AAC ACC 
His Gly Gly Asn Thr 
45 



TCG ACC GGT ACT TCC AAC AAA GAT TGG TGG CCG GAA GGG TTG AAC CTG 192 
Ser Thr Gly Thr Ser Asn Lys Asp Trp Trp Pro Glu Gly Leu Asn Leu 
50 55 60 

GAT ATT TTG CAT CAG CAA GAT CGC AAA TCA GAC CCG ATG GAT CCG GAT 240 
Asp He Leu His Gin Gin Asp Arg Lys Ser Asp Pro Met Asp Pro Asp 
65 70 75 80 

TTC AAC TAC CGT GAA GAA GTA CGC AAG CTC GAT TTC GAC GCG CTG AAG 288 
Phe Asn Tyr Arg Glu Glu Val Arg Lys Leu Asp Phe Asp Ala Leu Lys 
85 90 95 

AAA GAT GTC CAC GCG TTG ATG ACC GAT AGC CAA GAG TGG TGG CCC GCT 336 
Lys Asp Val His Ala Leu Met Thr Asp Ser Gin Glu Trp Trp Pro Ala 
100 105 HO 

GAC TGG GGG CAC TAC GGC GGT TTG ATG ATC CGT ATG GCT TGG CAC TCC 384 
A Trp Gly His Gly Gly Leu Met He Arg Met Ala Trp His Ser 

115 120 125 

GCT GGC ACC TAC CGT ATT GCT GAT GGC CGT GGG GGC GGT GGT ACC GGA 432 
Ala Gly Thr Tyr Arg He Ala Asp Gly Arg Gly Gly Gly Gly Thr Gly 
130 135 140 

AGC CAG CGC TTT GCA CCG CTC AAC TCC TGG CCG GAC AAC GTC AGC CTG 480 
s a Gin Arg Phe Ala Pro Leu Asn Ser Trp Pro Asp Asn Val Ser Leu 
14g 150 155 160 

G^r AAA GCG CGC CGT CTG CTG TGG CCG ATC AAG AAG AAG TAC GGC AAC 
AIM Lys Ala Arg Arg Leu Leu Trp Pro He Lys Lys Lys Tyr Gly Asn 
H j 165 170 175 

All ATC AGC TGG GCA GAC CTG ATG ATT CTG GCT GGC ACC GTG GCT TAT 
LvS He Ser Trp Ala Asp Leu Met He Leu Ala Gly Thr Val Ala Tyr 
fn 180 185 190 

GAG TCC ATG GGC TTA CCT GCT TAC GGC TTC TCT TTC GGC CGC GTC GAT 
Gl4 Ser Met Gly Leu Pro Ala Tyr Gly Phe Ser Phe Gly Arg Vai Asp 
03 195 200 205 

pM tgg GAA CCC GAA AAA GAT ATC TAC TGG GGT GAC GAA AAA GAG TGG 
iiy Trp Glu Pro Glu Lys Asp He Tyr Trp Gly Asp Glu Lys Glu Trp 
Q 210 215 220 

CTC GCA CCT TCT GAC GAA CGC TAC GGC GAC GTG AAC AAG CCA GAG ACC 
Leu Ala Pro Ser Asp Glu Arg Tyr Gly Asp Val Asn Lys Pro Glu Thr 
225 230 235 240 

ATG GAA AAC CCG CTG GCG GCT GTC CAA ATG GGT CTG ATC TAT CTG AAC 
Met Glu Asn Pro Leu Ala Ala Val Gin Met Gly Leu He Tyr Val Asn 
24S 250 255 

CCG GAA GGT GTT AAC GGC CAC CCT GAT CCG CTG AGA ACC GCA CAG CAG 816 
Pro Glu Gly Val Asn Gly His Pro Asp Pro Leu Arg Thr Ala Gin Gin 
260 265 270 

GTA CTT GAA ACC TTC GCC CGT ATG GCG ATG AAC GAC GAA AAA ACC GCA 
Val Leu Glu Thr Phe Ala Arg Met Ala Met Asn Asp Glu Lys Thr Ala 
275 280 285 

GCC CTC ACA GCT GGC GGC CAC ACC GTC GGT AAT TGT CAC GGT AAT GGC 
Ala Leu Thr Ala Gly Gly His Thr Val Gly Asn Cys His Gly Asn Gly 
290 295 300 

AAT GCC TCT GCG TTA GCC CCT GAC CCA AAA GCC TCT GAC GTT GAA AAC 
Asn Ala Ser Ala Leu Ala Pro Asp Pro Lys Ala Ser Asp Val Glu Asn 
305 310 315 320 



528 



576 



624 



672 



720 



768 



864 



912 



960 



CAG GGC TTA GGT TGG GGC AAC CCC AAC ATG 
Gin Gly Leu Gly Trp Gly Asn Pro Asn Met 
325 330 

GCC GTG ACC TCG GGT ATC GAA GGT GCT TGG 
Ala Val Thr Ser Gly lie Glu Gly Ala Trp 
340 345 

TTC GAT ATG GGC TAT TTC GAC CTG CTG TTC 
Phe Asp Met Gly Tyr Phe Asp Leu Leu Phe 
355 360 

AAA AAG AGT CCT GCC GGT GCC CAC CAT TGG 
Lys Lys Ser Pro Ala Gly Ala His His Trp 
370 375 

AAG GAA AAC AAG CCG GTT GAC GCC AGC GAC 
Lys Glu Asn Lys Pro Val Asp Ala Ser Asp 
385 390 

CCG ATC ATG ACC GAT GCG GAT ATG GCG ATA 
Pro lie Met Thr Asp Ala Asp Met Ala lie 
405 410 

GGC GCT ATC TGC GAA AAA TTC ATG GCC GAT 
hiq Ala He Cys Glu Lys Phe Met Ala Asp 
u3 420 425 

A£T TTC GCG AAG GCG TGG TTC AAG CTG ACG 
Tfir Phe Ala Lys Ala Trp Phe Lys Leu Thr 
FU 435 440 

|== 

AAA TCA CGT TAC ATC GGC CCG GAA GTG CCG 
£ffs Ser Arg Tyr He Gly Pro Glu Val Pro 
y] 450 455 

GAA GAC CCG ATT CCG GCA GGT AAC ACC GAC 
Gin Asp Pro He Pro Ala Gly Asn Thr Asp 
#5 470 

AAG CAG AAA ATT GCA CAA AGT GGC CTG AGC 
XMs Gin Lys He Ala Gin Ser Gly Leu Ser 
U3 485 490 

s - 
Ess; 

ACC GCT TGG GAC AGT GCC CGT ACT TAT CGC 
Thr Ala Trp Asp Ser Ala Arg Thr Tyr Arg 
500 505 

GGT GCT AAC GGT GCC CGC ATT CGC TTG GCC 
Gly Ala Asn Gly Ala Arg He Arg Leu Ala 
515 5*20 

GGC AAC GAG CCG GAG CGC CTG GCG AAA GTG 
Gly Asn Glu Pro Glu Arg Leu Ala Lys Val 
530 535 

ATC TCT GCC GAC ACC GGC GCT AGC ATC GCG 
He Ser Ala Asp Thr Gly Ala Ser He Ala 
545 550 

GGT AGC GTA GGC ATC GAG AAA GCC GCG AAA 
Gly Ser Val Gly He Glu Lys Ala Ala Lys 
565 570 

CGC GTT CCC TTC CTG AAA GGC CGT GGC GAT 
Arg Val Pro Phe Leu Lys Gly Arg Gly Asp 
580 58S 



CAG GGC AAG GCA AGC AAC 
Gin Gly Lys Ala Ser Asn 
335 

ACC ACC AAC CCC ACG AAA 
Thr Thr Asn Pro Thr Lys 
350 

GGC TAC AAT TGG GAA CTG 
Gly Tyr Asn Trp Glu Leu 
365 

GAA CCG ATT GAC ATC AAA 
Glu Pro He Asp He Lys 
380 

CCC TCT ATT CGC CAC AAC 
Pro Ser He Arg His Asn 
395 400 

AAG GTA AAT CCG ACC TAT 
Lys Val Asn Pro Thr Tyr 
415 

CCT GAG TAC TTC AAG AAA 
Pro Glu Tyr Phe Lys Lys 
430 

CAC CGT GAC CTG GGC CCG 
His Arg Asp Leu Gly Pro 
445 

GCA GAA GAC CTG ATT TGG 
Ala Glu Asp Leu He Trp 
460 

TAC TGC GAA GAA GTG GTC 
Tyr Cys Glu Glu Val Val 
475 480 

ATT AGT GAG ATG GTC TCC 
He Ser Glu Met Val Ser 
49S 

GGT TCC GAT ATG CGC GGC 
Gly Ser Asp Met Arg Gly 
510 

CCA CAG AAC GAG TGG CAG 
Pro Gin Asn Glu Trp Gin 
525 

CTG AGC GTC TAC GAG CAG 
Leu Ser Val Tyr Glu Gin 
540 

GAC GTG ATC GTT CTG GCC 
Asp Val He Val Leu Ala 
555 560 

GCA GCA GGT TAC GAT GTG 
Ala Ala Gly Tyr Asp Val 
S75 

GCG ACC GCC GAG ATG ACC 
Ala Thr Ala Glu Met Thr 
S90 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



GAC GCA GAC TCC TTC GCA CCG CTG GAG CCG CTG GCC GAT GGC TTC CGC 1824 
Asp Ala Asp Ser Phe Ala Pro Leu Glu Pro Leu Ala Asp Gly Phe Arg 
595 600 60S 

AAC TGG CAG AAG AAA GAG TAT GTG GTG AAG CCG GAA GAG ATG CTG CTG 1872 
Asn Trp Gin Lys Lys Glu Tyr Val Val Lys Pro Glu Glu Met Leu Leu 
610 615 620 

GAT CGT GCG CAG CTG ATG GGC TTA ACC GGC CCG GAA ATG ACC GTG CTG 1920 
Asn Arcr Ala Gin Leu Met Gly Leu Thr Gly Pro Glu Met Thr Val Leu 
625 630 635 640 

CTG GGC GGT ATG CGC GTA CTG GGC ACC AAC TAT GGT GGC ACC AAA CAC 1968 
Leu Gly Gly Met Arg Val Leu Gly Thr Asn Tyr Gly Gly Thr Lys His 
645 650 655 

GGC GTA TTC ACC GAT TGT GAA GGC CAG TTG ACC AAC GAC TTT TTT GTG 2016 
Glv Val Phe Thr Asp Cys Glu Gly Gin Leu Thr Asn Asp Phe Phe Val 
655 660 665 

AAC CTG ACC GAT ATG GGG AAC AGC TGG AAG CCG GTA GGT AGC AAC GCC 2064 
Asn Leu Thr Asp Met Gly Asn Ser Trp Lys Pro Val Gly Ser Asn Ala 
670 675 680 

TAC GAA ATC CGC GAC CGC AAG ACC GGT GCC GTG AAG TGG ACC GCC TCG 2112 
Tyr^Glu He Arg Asp Arg Lys Thr Gly Ala Val Lys Trp Thr Ala Ser 
^685 690 700 

CG€0GTG GAT CTG GTA TTT GGT TCC AAC TCG CTA CTG CGC TCT TAC GCA 2160 
ArgjVal Asp Leu Val Phe Gly Ser Asn Ser Leu Leu Arg Ser Tyr Ala 
70ll 710 715 720 

GAk^GTG TAC GCC CAG GAC GAT AAC GGC GAG AAG TTC GTC AGA GAC TTC 2208 
GlSVal Tyr Ala Gin Asp Asp Asn Gly Glu Lys Phe Val Arg Asp Phe 
~"' — " 730 735 



725 



GTC GCC GCC TGG ACC AAA GTG ATG AAC GCC GAC CGT TTC GAC GTC GCG 2256 
VaOAla Ala Trp Thr Lys Val Met Asn Ala Asp Arg Phe Asp Val Ala 
~ 740 745 750 



TC€HTAA 
Seal] 



(25^ INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 753 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Polypeptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Asn Asn Ala Ser Ala Asp Asp Leu His Ser Ser Leu Gin Gin Arg 

5 10 - 15 

Cvs Arq Ala Phe Val Pro Leu Val Ser Pro Arg His Arg Ala He Arg 
20 25 30 

Glu Arg Ala Met Ser Gly Lys Cys Pro Val Met His Gly Gly Asn Thr 
35 40 45 



2262 



Ser Thr Gly Thr Ser Asn Lys Asp Trp Trp Pro Glu Gly Leu Asn Leu 
50 55 60 

Asp He Leu His Gin Gin Asp Arg Lys Ser Asp Pro Met Asp Pro Asp 
65 70 75 80 

Phe Asn Tyr Arg Glu Glu Val Arg Lys Leu Asp Phe Asp Ala Leu Lys 
85 90 95 

Lys Asp Val His Ala Leu Met Thr Asp Ser Gin Glu Trp Trp Pro Ala 
7 100 105 HO 

Asp Trp Gly His Tyr Gly Gly Leu Met He Arg Met Ala Trp His Ser 
115 120 125 

Ala Gly Thr Tyr Arg He Ala Asp Gly Arg Gly Gly Gly Gly Thr Gly 
130 135 .140 

Ser Gin Arg Phe Ala Pro Leu Asn Ser Trp Pro Asp Asn Val Ser Leu 
145 150 155 160 

Asp Lys Ala Arg Arg Leu Leu Trp Pro He Lys Lys Lys Tyr Gly Asn 
165 170 175 

Lvs He Ser Trp Ala Asp Leu Met He Leu Ala Gly Thr Val Ala Tyr 
L iso 185 190 

dfu Ser Met Gly Leu Pro Ala Tyr Gly Phe Ser Phe Gly Arg Val Asp 
03 195 200 205 

ife Trp Glu Pro Glu Lys Asp He Tyr Trp Gly Asp Glu Lys Glu Trp 
210 215 220 

iki Ala Pro Ser Asp Glu Arg Tyr Gly Asp Val Asn Lys Pro Glu Thr 
225 230 235 240 

Met Glu Asn Pro Leu Ala Ala Val Gin Met Gly Leu He Tyr Val Asn 
Q 245 250 255 

P^p Glu Gly Val Asn Gly His Pro Asp Pro Leu Arg Thr Ala Gin Gin 



260 



Vaa Leu Glu Thr Phe Ala Arg Met Ala Met Asn Asp Glu Lys Thr Ala 

' — * - ~s n f\ 285 



275- 280 



Ala Leu Thr Ala Gly Gly His Thr Val Gly Asn Cys His Gly Asn Gly 
290 295 300 

Asn Ala Ser Ala Leu Ala Pro Asp Pro Lys Ala Ser Asp Val Glu Asn 
305 310 315 ^ u 

Gin Gly Leu Gly Trp Gly Asn Pro Asn Met Gin Gly Lys Ala Ser Asn 
330 J Jb 



325 



Ala val Thr Ser Gly He Glu Gly Ala Trp Thr Thr Asn Pro Thr Lys 
340 345 350 

Phe Asp Met Gly Tyr Phe Asp Leu Leu Phe Gly Tyr Asn Trp Glu Leu 
355 360 365 

Lys Lys Ser Pro Ala Gly Ala His His Trp Glu Pro He Asp He Lys 
370 375 380 

Lys Glu Asn Lys Pro Val Asp Ala Ser Asp Pro Ser He Arg His Asn 
385 390 395 *00 



# 



Pro lie Met Thr Asp Ala Asp Met Ala lie Lys Val Asn Pro Thr Tyr 
405 410 415 

Arg Ala lie Cys Glu Lys Phe Met Ala Asp Pro Glu Tyr Phe Lys Lys 
420 425 430 

Thr Phe Ala Lys Ala Trp Phe Lys Leu Thr His Arg Asp Leu Gly Pro 
435 440 445 

Lys Ser Arg Tyr He Gly Pro Glu Val Pro Ala Glu Asp Leu He Trp 
450 455 460 

Gin Asp Pro He Pro Ala Gly Asn Thr Asp Tyr Cys Glu Glu Val Val 
465 470 475 480 

Lys Gin Lys He Ala Gin Ser Gly Leu Ser He Ser Glu Met Val Ser 
485 490 495 

Thr Ala Trp Asp Ser Ala Arg Thr Tyr Arg Gly Ser Asp Met Arg Gly 
500 505 510 

Gly Ala Asn Gly Ala Arg He Arg Leu Ala Pro Gin Asn Glu Trp Gin 
515 520 525 

Gly,. Asn Glu Pro Glu Arg Leu Ala Lys Val Leu Ser Val Tyr Glu Gin 
U530 535 540 

IlJUer Ala Asp Thr Gly Ala Ser He Ala Asp Val He Val Leu Ala 
54§p 550 555 560 

sU 

GlCUser Val Gly He Glu Lys Ala Ala Lys Ala Ala Gly Tyr Asp Val 
565 570 575 

ArlSval Pro Phe Leu Lys Gly Arg Gly Asp Ala Thr Ala Glu Met Thr 
Si 580 585 590 

As pi Ala Asp Ser Phe Ala Pro Leu Glu Pro Leu Ala Asp Gly Phe Arg 
5;: 595 600 605 

Asn? Trp Gin Lys Lys Glu Tyr Val Val Lys Pro Glu Glu Met Leu Leu 
Si 610 615 €20 

AsP Arg Ala Gin Leu Met Gly Leu Thr Gly Pro Glu Met Thr Val Leu 
62 ^ 630 635 640 

Leu Gly Gly Met Arg Val Leu Gly Thr Asn Tyr Gly Gly Thr Lys His 
645 650 655 

Glv Val Phe Thr Asp Cys Glu Gly Gin Leu Thr Asn Asp Phe Phe Val 
1 660 665 670 

Asn Leu Thr Aso Met Gly Asn Ser Trp Lys Pro Val Gly Ser Asn Ala 
67S " 680 685 

Tyr Glu He Arg Asp Arg Lys Thr Gly Ala Val Lys Trp Thr Ala Ser 
690 695 700 

Arg Val Asp Leu Val Phe Gly Ser Asn Ser Leu Leu Arg Ser Tyr Ala 
705 710 715 720 

Glu Val TVr Ala Gin Asp Asp Asn Gly Glu Lys Phe Val Arg Asp Phe 
725 730 735 

Val Ala Ala Trp Thr Lys Val Met Asn Ala Asp Arg Phe Asp Val Ala 
740 745 750 

Ser 




(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 2238 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ATG GAA AAT CAC AAA CAC TCA GGA TCT TCT ACG TAT AAC ACA AAC ACT 
Met Glu Asn His Lys His Ser Gly Ser Ser Thr Tyr Asn Thr Asn Thr 

5 10 15 

GGC GGA AAA TGC CCT TTT ACC GGA GGT TCG CTT AAG CAA AGT GCA GGT 
Glv Gly Lys Cys Pro Phe Thr Gly Gly Ser Leu Lys Gin Ser Ala Gly 
20 25 30 

GGC GGC ACC AAA AAC AGG GAT TGG TGG CCC AAC ATG CTC AAC CTC GGC 
Gly Gly Thr Lys Asn Arg Asp Trp Trp Pro Asn Met Leu Asn Leu Gly 
Q 35 40 45 

A fi TTA CGC CAA CAT TCA TCG CTA TCG GAC CCA AAC GAC CCG GAT TTT 
iS Leu Arg Gin His Ser Ser Leu Ser Asp Pro Asn Asp Pro Asp Phe 



SO 



55 60 



T& GGT CAT TAT GGC CCC TTC TTT ATA CGC ATG GCG TGG CAC AGC GCC 
tS Gly His Tyr Gly Pro Pne Phe He Arg Met Ala Trp His Ser Ala 

ry ioo 10S 

JgC ACC TAC CGT ATC GGT GAT GGC CGT GGT GGC GGT GGC TCC GGC TCA 
dfy £hr Tyr Arg He Gly Asp Gly Arg Gly Gly Gly Gly Ser Gly Ser 
1 iis 120 125 

CAG CGC TTC GCG CCT CTC AAT AGC TGG CCA GAC AAT GCC AAT CTG GAT 
Sn Arg Pne Ala Pro Leu Asn Ser Trp Pro Asp Asn Ala Asn Leu Asp 
130 135 "0 

AAA GCA CGC TTG CTT CTT TGG CCC ATC AAA CAA AAA TAC GGT CGA AAA 
£ti Si Sg Su Su Leu Trp Pro He Lys Gin Lys Tyr Gly Arg Lys 

145 150 155 

ATC TCC TGG GCG GAT CTA ATG ATA CTC ACA GGA AAC GTA GCT CTG GAA 
lie Ser Trp Ala Asp Leu Met He Leu Thr Gly Asn Val Ala Leu Glu 
165 l* 70 175 

ACT ATG GGC TTT AAA ACT TTT GGT TTT GCA GGT GGC AGA ^JCA 
Thr Met Gly Phe Lys Thr Phe Gly Phe Ala Gly Gly Arg Ala Asp Val 
180 185 190 

TGG GAG CCT GAA GAA GAT GTA TAC TGG GGA GCA GAA GAA ]£GG 

Trp Glu Pro Glu Glu Asp Val Tyr Trp Gly Ala Glu Thr Glu Trp Leu 
* 19S 200 205 



48 



96 



144 



192 



Ml TAT GCC GAA GAG TTT AAG AAG CTA GAT CTG GCA GCG GTT AAA AAG 240 
SSI Sr £a Glu Glu Phe Lys Lys Leu Asp Leu Ala Ala Val Lys Lys 
feS 70 7 5 

JaC CTG GCA GCG CTA ATG ACA GAT TCA CAG GAC TGG TGG CCA GCA GAT 
S£ Leu Ala Ala Leu Met Thr Asp Ser Gin Asp Trp Trp Pro Ala Asp 
= — 85 90 " 



288 



336 



384 



432 



480 



528 



576 



624 



GGA GAC AAG CGC TAT GAA GGT GAC CGA GAG CTC GAA AAT CCC CTG GGA 672 
Gly Asp Lys Arg Tyr Glu Gly Asp Arg Glu Leu Glu Asn Pro Leu Gly 
210 21S 220 

GCC GTA CAA ATG GGA CTC ATC TAT GTA AAC CCC GAA GGA CCC AAC GGC 720 
Ala Val Gin Met Gly Leu He Tyr Val Asn Pro Glu Gly Pro Asn Gly 
225 230 235 240 

AAG CCA GAC CCT ATC GCT GCT GCG CGT GAT ATT CGT GAG ACT TTT GGC 768 
Lys Pro Asp Pro He Ala Ala Ala Arg Asp He Arg Glu Thr Phe Gly 
245 250 255 

CGA ATG GCA ATG AAT GAC GAA GAA ACC GTG GCT CTC ATA GCG GGT GGA 816 
Arg Met Ala Met Asn Asp Glu Glu Thr Val Ala Leu He Ala Gly Gly 
260 265 270 

CAC ACC TTC GGA AAA ACC CAT GGT GCT GCC GAT GCG GAG AAA TAT GTG 864 
His Thr Phe Gly Lys Thr His Gly Ala Ala Asp Ala Glu Lys Tyr Val 
275 280 285 

GGC CGA GAG CCT GCC GCC GCA GGT ATT GAA GAA ATG AGC CTG GGG TGG 912 
Gly Arg Glu Pro Ala Ala Ala Gly He Glu Glu Met Ser Leu Gly Trp 
290 295 300 

AAA^AAC ACC TAC GGC ACC GGA CAC GGT GCG GAT ACC ATC ACC AGT GGA 960 
LyJHAsn Thr Tyr Gly Thr Gly His Gly Ala Asp Thr He Thr Ser Gly 
305P 310 315 320 

CTaJIgAA GGC GCC TGG ACC AAG ACC CCT ACT CAA TGG AGC AAT AAC TTT 1008 
Le\j^Glu Gly Ala Trp Thr Lys Thr Pro Thr Gin Trp Ser Asn Asn Phe 
lU 325 330 335 

e = 

TtSgAA AAC CTC TTT GGT TAC GAG TGG GAG CTT ACC AAA AGT CCA GCT 1056 
PhiKGlu Asn Leu Phe Gly Tyr Glu Trp Glu Leu Thr Lys Ser Pro Ala 
y! 340 345 350 

GGA^GCT TAT CAG TGG AAA CCA AAA GAC GGT GCC GGG GCT GGC ACC ATA 1104 
Gl^Ala Tyr Gin Trp Lys Pro Lys Asp Gly Ala Gly Ala Gly Thr He 

0=f 355 360 365 

CCGMGAT GCA CAT GAT CCC AGC AAG TCG CAC GCT CCA TTT ATG CTC ACT 1152 
ProSAsp Ala His Asp Pro Ser Lys Ser His Ala Pro Phe Met Leu Thr 
^370 375 380 

ACG GAC CTG GCG CTG CGC ATG GAC CCT GAT TAC GAA AAA ATT TCT CGA 1200 
Thr Asp Leu Ala Leu Arg Met Asp Pro Asp Tyr Glu Lys He Ser Arg 
385 390 395 400 

CGG TAC TAT GAA AAC CCT GAT GAG TTT GCA GAT GCT TTC GCG AAA GCA 1248 
Ara TVr Tyr Glu Asn Pro Asp Glu Phe Ala Asp Ala Phe Ala Lys Ala 
405 410 415 

TGG TAC AAA CTG ACA CAC AGA GAT ATG GGA CCA AAG GTG CGC TAC CTG 1296 
Trp Tyr Lys Leu Thr His Arg Asp Met Gly Pro Lys Val Arg Tyr Leu 
420 425 430 

GGA CCA GAA GTG CCT CAG GAA GAC CTC ATC TGG CAA GAC CCT ATA CCA 1344 
Gly Pro Glu Val Pro Gin Glu Asp Leu He Trp Gin Asp Pro He Pro 
435 440 445 

GAT GTA AGC CAT CCT CTT GTA GAC GAA AAC GAT ATT GAA GGC CTA AAA 1392 
Asp Val Ser His Pro Leu Val Asp Glu Asn Asp He Glu Gly Leu Lys 
450 455 460 

GCC AAA ATC CTG GAA TCG GGA CTG ACG GTA AGC GAG CTG GTA AGC ACG 1440 
Ala Lys He Leu Glu Ser Gly Leu Thr Val Ser Glu Leu Val Ser Thr 
465 470 475 480 



GCA TGG GOT TCT GCA TCT ACT TTT AGA AAC TCT GAC AAG CGC GGC GGT 1488 
Ala Trp Ala Ser Ala Ser Thr Phe Arg Asn Ser Asp Lys Arg Gly Gly 
485 490 500 

GCC AAC GGT GCA CGT ATA CGA CTG GCC CCA CAA AAA GAC TGG GAA GTA 1536 
Ala Asn Gly Ala Arg He Arg Leu Ala Pro Gin Lys Asp Trp Glu Val 
505 510 515 

AAC AAC CCT CAG CAA CTT GCC AGG GTA CTC AAA ACA CTA GAA GGT ATC 1584 
Asn Asn Pro Gin Gin Leu Ala Arg Val Leu Lys Thr Leu Glu Gly lie 
520 525 530 

CAG GAG GAC TTT AAC CAG GCG CAA TCA GAT AAC AAA GCA GTA TCG TTG 1632 
Gin Glu Asp Phe Asn Gin Ala Gin Ser Asp Asn Lys Ala Val Ser Leu 
535 540 545 

GCC GAC CTG ATT GTG CTG GCC GGC TGT GCG GGT GTA GAA AAA GCT GCA 1680 
Ala Asp Leu He Val Leu Ala Gly Cys Ala Gly Val Glu Lys Ala Ala 
550 555 560 565 

AAA GAT GCT GGC CAT GAG GTG CAG GTG CCT TTC AAC CCG GGA CGA GCG 
Lys Asp Ala Gly His Glu Val Gin Val Pro Phe Asn Pro Gly Arg Ala 
570 575 580 

GAT GCC ACC GCT GAG CAA ACC GAT GTG GAA GCT TTC GAA GCA CTA GAG 
ASb Ala Thr Ala Glu Gin Thr Asp Val Glu Ala Phe Glu Ala Leu Glu 
585 590 595 

CfeSl GCG GCT GAC GGC TTT AGA AAC TAC ATT AAA CCG GAG CAT AAA GTA 
pBo Ala Ala Asp Gly Phe Arg Asn Tyr He Lys Pro Glu His Lys Val 

1=1 1 600 605 610 

Tfe GCT GAG GAA ATG CTC GTA GAC CGG GCG CAG CTT CTG TCG CTT TCG 
Sir Ala Glu Glu Met Leu Val Asp Arg Ala Gin Leu Leu Ser Leu Ser 
in 615 620 625 

GCA CCA GAA ATG ACT GCT TTG GTA GGC GGT ATG CGT GTA CTG GGC ACC 
SL pS Glu Met Thr Ala Leu Val Gly Gly Met Arg Val Leu Gly Thr 

em 635 640 

A® TAC GAC GGT TCG CAG CAT GGA GTG TTT ACA AAT AAG CCG GGT CAG 
£E Tyi Aso Gly Ser Gin His Gly Val Phe Thr Asn Lys Pro Gly Gin 
Q * 650 655 * 6o0 

cfe TCC AAT GAC TTC TTT GTA AAC CTG CTA GAC CTC AAC ACT AAA TGG 
£eu Ser Asn Asp Phe Phe Val Asn Leu Leu Asp Leu Asn Thr Lys Trp 
665 670 675 

CGA GCC AGC GAT GAA TCA GAC AAA GTT TTT GAA GGC AGA GAC TT C AAA 
SSaSerAsp Glu Ser Asp Lys Val Phe Glu Gly Arg Asp Phe Lys 
3 680 685 690 

ACT GGC GAA GTA AAG TGG AGT GGC ACC CGG GTA GAC CTG ATC TTC GGA 
i5 Su Si Lys Trp Ser Gly Thr Arg Val Asp Leu He Phe Gly 

695 700 710 

TCC AAT TCC GAG CTA AGA GCC CTC GCA GAA GTG TAC GGC TGT GCA GAT 
Ser Asn Ser Glu Leu Arg Ala Leu Ala Glu Val Tyr Gly Cys Ala Asp 
71S 720 725 730 

TCT GAA GAA AAG TTT GTT AAA GAT TTT GTG AAG GCC TGG GCC AAA GTA 
Ser Glu Glu Lys Phe Val Lys Asp Phe Val Lys Ala Trp Ala Lys Val 
735 740 745 

ATG GAC CTG GAC CGG TTT GAT CTG AAA TAA 
Met Asp Leu Asp Arg Phe Asp Leu Lys 
750 75S 



1728 



1776 



1824 



1872 



1920 



1968 



2016 



2064 



2112 



2160 



2208 



2238 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 74 5 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Glu Asn His Lys His Ser Gly Ser Ser Thr Tyr Asn Thr Asn Thr 

5 10 15 

Gly Gly Lys Cys Pro Phe Thr Gly Gly Ser Leu Lys Gin Ser Ala Gly 
20 25 30 

Gly Gly Thr Lys Asn Arg Asp Trp Trp Pro Asn Met Leu Asn Leu Gly 
35 40 45 

He Leu Arg Gin His Ser Ser Leu Ser Asp Pro Asn Asp Pro Asp Phe 
50 55 60 

As© TVr Ala Glu Glu Phe Lys Lys Leu Asp Leu Ala Ala Val Lys Lys 
G® 70 75 80 

AsS Leu Ala Ala Leu Met Thr Asp Ser Gin Asp Trp Trp Pro Ala Asp 
-~= — 90 95 



85 



Tvfe Glv His Tyr Gly Pro Phe Phe lie Arg Met Ala Trp His Ser Ala 
% 100 10 S no 

Gli Thr Tyr Arg He Gly Asp Gly Arg Gly Gly Gly Gly Ser Gly Ser 
/ 11S 120 I 25 

GlS Arg Phe Ala Pro Leu Asn Ser Trp Pro Asp Asn Ala Asn Leu Asp 
yj 130 135 I 40 

Lyji Ala Arg Leu Leu Leu Trp Pro He Lys Gin Lys Tyr Gly Arg Lys 

i4S 150 

lU Ser Trp Ala Asp Leu Met He Leu Thr Gly Asn Val Ala Leu Glu 
170 1 



165 



Thr Met Gly Phe Lys Thr Phe Gly Phe Ala Gly Gly Arg Ala Asp Val 

180 1 QS 

Trp Glu Pro Glu Glu Asp Val Tyr Trp Gly Ala Glu Thr Glu Trp Leu 
195 200 205 

Gly Asp Lys Arg Tyr Glu Gly Asp Arg Glu Leu Glu Asn Pro Leu Gly 
210 21 s 220 

Ala Val Gin Met Gly Leu He Tyr Val Asn Pro Glu Gly Pro Asn Gly 
225 230 235 

Lys Pro Asp Pro He Ala Ala Ala Arg Asp He Arg Glu Thr Phe Gly 
1 245 250 255 

Arg Met Ala Met Asn Asp Glu Glu Thr Val Ala Leu He Ala Gly Gly 
3 260 265 270 

His Thr Phe Gly Lys Thr His Gly Ala Ala Asp Ala Glu Lys Tyr Val 
275 280 285 



Gly Arg Glu Pro Ala Ala Ala Gly He Glu Glu Met Ser Leu Gly Trp 
290 295 300 

Lys Asn Thr Tyr Gly Thr Gly His Gly Ala Asp Thr He Thr Ser Gly 
305 310 315 320 

Leu Glu Gly Ala Trp Thr Lys Thr Pro Thr Gin Trp Ser Asn Asn Phe 
325 330 335 

Phe Glu Asn Leu Phe Gly Tyr Glu Trp Glu Leu Thr Lys Ser Pro Ala 
340 345 350 

Gly Ala Tyr Gin Trp Lys Pro Lys Asp Gly Ala Gly Ala Gly Thr He 
355 360 365 

Pro Asp Ala His Asp Pro Ser Lys Ser His Ala Pro Phe Met Leu Thr 
370 375 380 

Thr Asp Leu Ala Leu Arg Met Asp Pro Asp Tyr Glu Lys He Ser Arg 
385 390 395 400 

Ara TVr Tyr Glu Asn Pro Asp Glu Phe Ala Asp Ala Phe Ala Lys Ala 
a 1 405 410 415 

Trr> Tvr Lys Leu Thr His Arg Asp Met Gly Pro Lys Val Arg Tyr Leu 
- 2 1 425 430 



420 



Gl^Pro Glu Val Pro Gin Glu Asp Leu He Trp Gin Asp Pro He Pro 
*-n 435 440 445 



Astral Ser His Pro Leu Val Asp Glu Asn Asp He Glu Gly Leu Lys 
-450 455 460 



AlaXLys He Leu Glu Ser Gly Leu Thr Val Ser Glu Leu Val Ser Thr 
465^ 470 475 

Ala 5 Trp Ala Ser Ala Ser Thr Phe Arg Asn Ser Asp Lys Arg Gly Gly 
Q 485 490 495 

AlfiAsn Gly Ala Arg lie Arg Leu Ala Pro Gin Lys Asp Trp Glu Val 

h=s 500 505 510 

AsnTAsn Pro Gin Gin Leu Ala Arg Val Leu Lys Thr Leu Glu Gly He 
U. SIS 520 S2S 

Gin Glu Asp Phe Asn Gin Ala Gin Ser Asp Asn Lys Ala Val Ser Leu 
530 535 540 

Ala Asp Leu He Val Leu Ala Gly Cys Ala Gly Val Glu Lys Ala Ala 

545 550 555 

Lys Asp Ala Gly His Glu Val Gin Val Pro Phe Asn Pro Gly Arg Ala 
1 565 570 =75 

Asp Ala Thr Ala Glu Gin Thr Asp Val Glu Ala Phe Glu Ala Leu Glu 
580 S8S 590 

Pro Ala Ala Asp Gly Phe Arg Asn Tyr lie Lys Pro Glu His Lys Val 

600 605 



595 



Ser Ala Glu Glu Met Leu Val Asp Arg Ala Gin Leu Leu Ser Leu Ser 
610 61S 620 

Ala Pro Glu Met Thr Ala Leu Val Gly Gly Met Arg Val Leu Gly Thr 

625 630 635 640 



Asn Tyr Asp Gly 



Leu Ser Asn Asp 
660 

Arg Ala Ser Asp 
675 

Thr Gly Glu Val 
690 

Ser Asn Ser Glu 
705 

Ser Glu Glu Lys 



Met Asp Leu Asp 
740 



Ser Gin His Gly 
645 

Phe Phe Val Asn 



Glu Ser Asp Lys 
680 

Lys Trp Ser Gly 
695 

Leu Arg Ala Leu 
710 

Phe Val Lys Asp 
725 

Arg Phe Asp Leu 



Val Phe Thr Asn 
650 

Leu Leu Asp Leu 
665 

Val Phe Glu Gly 



Thr Arg Val Asp 
700 



Ala Glu Val Tyr 
715 

Phe Val Lys Ala 
730 

Lys 
745 



Lys Pro Gly Gin 
655 

Asn Thr Lys Trp 
670 

Arg Asp Phe Lys 
685 

Leu lie Phe Gly 



Gly Cys Ala Asp 
720 

Trp Ala Lys Val 
735 
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